Discovering Potential Terminological Relationships from Twitter's Timed Content

نویسندگان

  • Mohammad Daoud
  • Daoud Daoud
چکیده

This paper presents a method to discover possible terminological relationships from tweets. We match the histories of terms (frequency patterns). Similar history indicates a possible relationship between terms. For example, if two terms (t1, t2) appeared frequently in Twitter at particular days, and there is a ‘similarity’ in the frequencies over a period of time, then t1 and t2 can be related. Maintaining standard terminological repository with updated relationships can be difficult; especially in a dynamic domain such as social media where thousands of new terms (neology) are coined every day. So we propose to construct a raw repository of lexical units with unconfirmed relationships. We have experimented our method on time-sensitive Arabic terms used by the online Arabic community of Twitter. We draw relationships between these terms by matching their similar frequency patterns (timelines). We use dynamic time warping as a similarity measure. For evaluation, we have selected 630 possible terms (we call them preterms) and we matched the similarity of these terms over a period of 30 days. Around 270 correct relationships were discovered with a precision of 0.61. These relationships were extracted without considering the textual context of the term.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Research Paper: Using Semantic and Structural Properties of the Unified Medical Language System to Discover Potential Terminological Relationships

OBJECTIVE To use the semantic and structural properties in the Unified Medical Language System (UMLS) Metathesaurus to characterize and discover potential relationships. DESIGN The UMLS integrates knowledge from several biomedical terminologies. This knowledge can be used to discover implicit semantic relationships between concepts. In this paper, the authors propose a problem-independent app...

متن کامل

Discovering Temporal Knowledge from a Crisscross of Timed Observations

This paper is concerned with the discovering of temporal knowledge from a sequence of timed observations provided by a system monitoring of dynamic process. The discovering process is based on the Stochastic Approach framework where a series of timed observations is represented with a Markov chain. From this representation, a set of timed sequential binary relations between discrete event class...

متن کامل

Using Semantic and Structural Properties of the Unified Medical Language System to Discover Potential Terminological Relationships

Design: The UMLS integrates knowledge from several biomedical terminologies. This knowledge can be used to discover implicit semantic relationships between concepts. In this paper, the authors propose a problemindependent approach for discovering potential terminological relationships that employs semantic abstraction of indirect relationship paths to perform classification and analysis of netw...

متن کامل

Bridging the Semantic Gap: Exploring Descriptive Vocabulary for Image Structure

This poster summarizes the methodology and results from an experiment to collect terms used by subjects in the verbal description of images from the domains of abstract art, satellite imagery and photo-microscopy. The resulting natural language vocabulary was then analyzed to identify a set of concepts that were shared across subjects. These concepts were subsequently organized as a faceted voc...

متن کامل

Discovering N-ary Timed Relations from Sequences - a Position Paper

The goal of this position paper is to show the problems with most known timed data mining techniques for discovering temporal knowledge from a set of timed messages sequences. Two of the main problems with these techniques is the sensitivity of the algorithms to the value of their parameters and the too large number of discovered patterns. We argue that these problems can be avoided using anoth...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016